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Abstract. The parameter fit from a model grid is limited by our capability to reduce the number 
of models, taking into account the number of parameters and the non linear variation of the models 
with the parameters. The Local MultiLinear Regression (LMLR) algorithms allow one to fit linearly 
the data in a local environment. The MATISSE algorithm, developed in the context of the estimation 
of stellar parameters from the Gaia RVS spectra, is connected to this class of estimators. A two-steps 
procedure was introduced. A raw parameter estimation is first done in order to localize the parameter 
environment. The parameters are then estimated by projection on specific vectors computed for 
an optimal estimation. The MATISSE method is compared to the estimation using the objective 
analysis. In this framework, the kernel choice plays an important role. The environment needed for 
the parameter estimation can result from it. The determination of a first parameter set can be also 
avoided for this analysis. These procedures based on a local projection can be fruitfully applied to 
non linear parameter estimation if the number of data sets to be fitted is greater than the number of 
models. 
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MODELS AND PARAMETERS. 

The comparison between observations and models obtained from simulations is nowa- 
days a common activity in astrophysics. Each model is defined by a set of physical pa- 
rameters 0. It carries out a set of data which can be reduced to equivalent observational 
measurements 5, taking into account the properties of the used instrumentation. Detector 
noise and instrumental uncontrolled variations lead to associate to observational data O a 
conditional probability density function (PDF) p{0/S). Currently, this PDF is admitted 
to be Gaussian. 

The application of the Maximum Likelihood (ML) principle ifP] leads to get the 
parameter values which maximize p{0/S). Taking into account the hypothesis of 
a Gaussian noise, this is equivalent to determine the parameters which minimize a 
quadratic distance between the observations and their simulations. A large literature ex- 
ists on the least mean squares (LMS) method [18]. In case of an analytical model, normal 
equations are derived and the problem is reduced to a classical numerical problem. Gen- 
erally, the models carry out equivalent observational data which cannot be expressed as 
analytical functions of the physical parameters. Each comparison needs to compute a 
model, which can be very CPU time consuming. In order to reduce the cost, the models 
are computed on a parameter grid, its sampling being as large as possible such that the 
parameter extraction results from an interpolation in this space. 



In the framework of the determination of the atmospheric parameters from the stellar 
spectra obtained with the Gaia RVS [17] we developed a new algorithm called MATISSE 
(16]. This algorithm is based on the determination of the parameter set by projection of 
the observed spectrum O to a set of vectors Bi computed for each parameter Gj. MA- 
TISSE belongs to a class of statistical methods which allows one to estimate parameters 
from of an optimal projection on a local environment. The projection vectors derive from 
a multilinear regression which takes only into account the models locally computed. 



THE DISTANCE MINIMIZATION AND THE MATISSE METHOD. 

The parameter estimation problem consists into finding the minimum distance D(0) 
between the observable O, here an observed spectrum, and a peculiar model from a 
theoretical grid, here a theoretical spectrum S. In order to simplify the presentation it is 
admitted that the noise variance is constant and that the measurements are independent. 
Therefore, the considered distance is: 

D{@)= £ [0{1)-S{1,@)]\ (1) 

/=1,L 

The resulting normal equation for a given parameter 0,- can be derived. The related 
equation system is generally non linear. To solve it, one needs to start from a given 
parameter set 0^°) and to linearize it around these values. The corrections can be large 
so that an iteration could be necessary. There is no guarantee that the values after 
convergence correspond to the absolute minimum distance. The raw solving way would 
consist into computing a set of models with a sufficient grid resolution for avoiding 
non convexity of the distance function in a given mesh. All the distances to the models 
are computed and the absolute minimum is localized. In a convex region the distance 
function is quite a quadratic function of the parameters and a simple linearization is 
sufficient. However, for models associated to many parameters which can vary on a large 
dynamical range, this procedure could be too heavy. Many algorithms were proposed for 
avoiding the computation of the whole model grid. Since the 70's, genetic algorithms 
were recognized as a fine way for solving optimization problems This technique was 
applied for stellar parameter estimation from spectrograms . Simulated annealing [ 10 ] 



was also developed and appeared to be well adapted to carry out parameter estimation 
dsj]. Artificial neural networks (ANN) were also used for this specific problem [0]. 

Let us come back to the normal equations. The parameter corrections can be written 
as: 

So we can write the relation: 

5@ = G^[T{0-S)]. (3) 



where — S designs the residual vector (length L), T the Lx I model derivative matrix 
and G the 7x7 Gram-Schmidt matrix of the model derivatives. With Equation ([3]), the 



parameter corrections are expressed as linear combinations of the model residuals. The 
associated coefficients depend on the partial derivatives. In our studied problem only 
computed models on a grid are available. Approximated derivatives can be computed 
but it is better to embed this approximation into a more general scheme taking into ac- 
count the grid sampling. In this scheme, the MATrix Inversion for Spectral SynthEsis 
(MATISSE [16]) method, the algorithm determines a vector, 5/, allowing one to derive 
a particular parameter 0/ by projection of an input observed spectrum on it. The 5, vec- 
tor is derived from an optimal linear combination of models in a local environment, the 
environment depending on the model linearity versus the parameters. 5,- is determined 
from the principle that the statistical correlation between the input and the output pa- 
rameter values is maximum. With a spectra energy normalization this is equivalent to 
minimize the distance between the input and the output parameter values. 

First of all, the data on a particular 0,; parameter and the spectra of the grid are 
subtracted by their mean values. The 5,- vector is then constructed from: 

B^{l)= £ aijSj{l). (4) 

6i is estimated by the spectrum projection: 

0,=£5,(/)O(Z). (5) 
/ 

In other words, is estimated by multilinear regressions done on the spectrum values. 
Equation dH) is applied to all the model spectra Sj{l). Combining with Equation ([5]) we 
obtain : 

0. = Ca„ (6) 

where C = [cjf] is the correlation matrix of the spectra and 0, the vector of the parameter 
i for all the spectra. For an invertible variance-covariance matrix, we get: 

a,=C^0„ (7) 

which leads for the training set to: 

&i = SB = S{Sai)=CC^&i = &i. (8) 

The exact values are restored. In order to be sure that a peculiar model was not 
forgotten for the basis building a redundancy is necessary. That leads to a non invertible 
matrix. The LMS solution between the input parameters and their estimated ones leads 
to: 

C^Ca, = C^0/. (9) 

Even if C is singular, the multiplication by its joint matrix brings a solution in the 
LMS sense. The inversion can be done, for example, using the classical Landweber 
iterative algorithm (Landweber, 1951). 

The multilinear regression model stands only if the model is quite linear. Generally it 
is not the case and it is needed to process in two steps (Fig. [B: 

1. A preliminary estimation of the parameters is performed through the use of Bq^ 
functions computed from spectra combinations spanning the parameter range of 
the whole grid. 



2. This first estimation allows one to focus into a subregion of the model grid, where 

(f) 

new BA functions constructed from a subset of theoretical spectra are applied. 




FIGURE 1. Scheme of the MATISSE algorithm for the estimation of the stellar parameters. 



THE OBJECTIVE ANALYSIS WITH KERNEL. 

The Objective Analysis recovers a large set of interpolation methods developed in the 
framework of Geosciences. The function f{x) is known on J points {xj, j E (1,^)) and it 
is interpolated with the rule: 

y , f(Xj) 

fix) = ^ \ (10) 



Many determinist or statistical methods, like the kriging [|14l1 11211 . were proposed for 
the determination of the coefficients Wj{x). The weights can be directly obtained from a 
Kernel function K{^) [II], a being a scaling parameter. The Nadaray a- Watson weighted 
average [JJ consists in the following rule: 



In order to obtain a true interpolation for which the values are exactly recovered at the 
mesh points, the set f{xj) such that: 

is determined. Equation (fT2l) is solved iteratively. The interpolation rule on the parame- 
ters 9i is thus: 

m = ^^-^^^^^'^ (13) 

where the O/y are previously determined by inversion. 
Let us consider the Epanechnikov kernel [jsj]: 

K{x) = ^{\-x^) if \x\<l otherwise K{x)=0. (14) 

Taking into that the spectra normalization the weights can be simplified as: 

Wjf=H{cjf-c), (15) 

where c is a correlation threshold and H{x) the Heaviside function (H{x) = I if x >0 
and null otherwise). The weights for a spectrum O are thus: 

Wj = H{cj -c)= HC£ 0(1)5 j{l) - c). (16) 

The parameter 0, is estimated by the relation: 

. ^ LjH{LiOil)Sj{l)-c)0ij 

' LjH{LiO{l)Sj{l)~c) ■ ^ ^ 

Defining e{0) as the set of spectra Sj such that cj > c, relation (fTTl) can be written as: 

^ 'LlO{l)^j^e{0)Si{l)Qii\-c'Ljee{0)^ii 

Qi = ^ , (18) 

where W = Ejee(o) ^j- Thus it results that: 

0, = £O(Z)5f(Z)-0f (19) 

where 5f(/) = ■^T.jee{0)'^ = i^zMil^ parameters 0/y are deter- 
mined by the inversion of the equation: 

T.rH(cjj' — c)0ij 
0. . = \JJ . (20) 



TABLE 1. Experimental results on a reduced grid. 



Method 



T-max (7(r) logg-max a{\ogg) jx-max (t(/x) 



Kernel at 0.995 
MATISSE BO 
MATISSE BO+Bf 
BO+Bf 99% stars 



122 
1402 
1709 



0. 



13 
451 
104 

0. 



2.95 
6. 
0. 



0.42 



0.06 
0.69 
0.19 
0. 



0.17 
1.32 
1.34 
0. 



0.03 
0.26 
0.08 
0. 



This equation, with a null threshold, is equivalent to the one associated to the MA- 
TISSE method, giving the Of,- value (Equation (6]). In the MATISSE case, the environment 
is defined from the grid sampling. Here the environment results from the correlation 
sampling for the objective analysis. The estimation is done by a correlation product, 
with a function which varies according to the correlations. It can be noted that the 5? 
varies for a spectrum to another, but the contributions associated to each spectrum are 
computed in one step. 



A set of 1386 spectra with 971 Gaia RVS spectral elements were computed. In this grid, 
3 physical parameters vary, the temperature T, the surface gravity logg and the metal- 
licity ji. Experiments were done with different kernels. In the case of the Epanechnikov 
kernel, different correlation thresholds were examined. For the studied grid, the environ- 
ment, defined by the number of remaining spectra for each one, seems to be sufficient at 
a 0.995 threshold. In Table [Da comparison is presented with different experiments: 

Kernel The parameters are restored using the Nadaraya- Watson weighting average, 
with the Epanechnikov kernel. The correlation threshold is equal to 0.995. The 
environment varies from 2 to 68 spectra with this value. The minimum number is 
obtained at points on the grid bound, for which the environment is insufficient to 
get an accurate value. 

MATISSE BO The parameters are determined by one projection. The maximal errors 

are greater than the grid steps. This operation allows one only to localize the 

environment for obtaining the correct values. 
MATISSE BO+Bf After localization, a projection is done for obtaining correct values. 

Few iterations may be needed in order to get the environment corresponding to the 

parameters. 

MATISSE with rejection The previous algorithm works perfectly on 99% of the grid 
spectra. The errors are related to the grid boundary. At the first step the estimated 
parameters may be too far from real ones. The Bf corrections do not allow us to 
converge correctly. A simple test based on the correlation between the restored 
spectrum and the observed one allows us to reject the bad estimations. 

These experiments show that the kernel method is well designed for a first estimation 
of the parameter values on a reduced grid. It is necessary for that to compute theoretically 



EXPERIMENTATIONS ON A SPECTRA SET. 



all the correlations. Specific algorithms have to be developed for reducing this number 
taking into account the correlation threshold. 

CONCLUSION. 

The MATISSE method allows one to improve model fitting in the case of: 

• A capability to describe observations by a suitable model depending of few param- 
eters; 

• The models can not be analytically manipulated, avoiding to use their derivatives 

for the estimations; 

• The need to fit a huge number of observational data. 

The main part of the computation is related to the basis estimation, the parameter 
estimation itself being very fast. The use of the Epanechnikov's kernel method allows 
one to get in one inversion all the coefficients. The number of spectra having a correlation 
above the threshold according to a given grid point indicates if locally the sampling is in 
agreement with the searched accuracy. 

Taking into account their capabilities to process many sets of astronomical observa- 
tions, with a high statistical consistence level, the estimation methods based on local 
projections like MATISSE open a new window on astrophysical modelization for large 
astrophysical surveys. 
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